Overview

Dataset Statistics

Number of Variables 10
Number of Rows 140
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 22.2 KB
Average Row Size in Memory 162.4 B
Variable Types
  • Categorical: 4
  • Numerical: 6

Dataset Insights

Turnover_lay and Turnover_2012 have similar distributions Similar Distribution
Turnover_lay and Total_assets_2012 have similar distributions Similar Distribution
Turnover_2012 and Total_assets_2012 have similar distributions Similar Distribution
Patent_count is skewed Skewed
Turnover_lay is skewed Skewed
Turnover_2012 is skewed Skewed
Total_assets_2012 is skewed Skewed
Employees_2012 is skewed Skewed
R&D_2012 is skewed Skewed
ID has a high cardinality: 127 distinct values High Cardinality
University has constant value "0" Constant
Patent_industry has constant length 1 Constant Length
University has constant length 1 Constant Length
Country_code has constant length 3 Constant Length
  • 1
  • 2

Variables

ID

categorical

Approximate Distinct Count 127
Approximate Unique (%) 90.7%
Missing 0
Missing (%) 0.0%
Memory Size 12.4 KB

Length

Mean 24.7429
Standard Deviation 9.8903
Median 22
Minimum 6
Maximum 66

Sample

1st row JSR CORPORATION
2nd row Toppan Printing Co...
3rd row BP Chemicals Limit...
4th row Sharp Kabushiki Ka...
5th row Bridgestone Corpor...

Letter

Count 2996
Lowercase Letter 2279
Space Separator 336
Uppercase Letter 717
Dash Punctuation 6
Decimal Number 1

Patent_industry

categorical

Approximate Distinct Count 5
Approximate Unique (%) 3.6%
Missing 0
Missing (%) 0.0%
Memory Size 9.0 KB
  • The largest value (4) is over 3.71 times larger than the second largest value (3)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 140
  • The top 2 categories (4, 3) take over 50.0%
  • The largest value (4) is over 3.71 times larger than the second largest value (3)
  • Patent_industry has words of constant length

University

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Memory Size 9.0 KB

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 140
  • University has words of constant length

Patent_count

numerical

Approximate Distinct Count 102
Approximate Unique (%) 72.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2.2 KB
Mean 6673.1357
Minimum 1
Maximum 204120
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Patent_count is skewed right (γ1 = 6.0891)

Quantile Statistics

Minimum 1
5-th Percentile 3
Q1 20
Median 325
Q3 1493.25
95-th Percentile 20118
Maximum 204120
Range 204119
IQR 1473.25

Descriptive Statistics

Mean 6673.1357
Standard Deviation 27261.5772
Variance 7.4319e+08
Sum 934239
Skewness 6.0891
Kurtosis 38.8672
Coefficient of Variation 4.0853
  • Patent_count is not normally distributed (p-value 6.23224576106292e-25)
  • Patent_count has 23 outliers

Turnover_lay

numerical

Approximate Distinct Count 102
Approximate Unique (%) 72.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2.2 KB
Mean 3.9204e+07
Minimum 1195
Maximum 2.7667e+08
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Turnover_lay is skewed right (γ1 = 2.6569)

Quantile Statistics

Minimum 1195
5-th Percentile 63729
Q1 3.31e+06
Median 1.1976e+07
Q3 4.8105e+07
95-th Percentile 2.4238e+08
Maximum 2.7667e+08
Range 2.7667e+08
IQR 4.4795e+07

Descriptive Statistics

Mean 3.9204e+07
Standard Deviation 6.5386e+07
Variance 4.2754e+15
Sum 5.4885e+09
Skewness 2.6569
Kurtosis 6.6635
Coefficient of Variation 1.6679
  • Turnover_lay is not normally distributed (p-value 9.83031129262479e-19)
  • Turnover_lay has 10 outliers

Turnover_2012

numerical

Approximate Distinct Count 102
Approximate Unique (%) 72.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2.2 KB
Mean 3.7164e+07
Minimum 0
Maximum 3.7712e+08
Zeros 1
Zeros (%) 0.7%
Negatives 0
Negatives (%) 0.0%
  • Turnover_2012 is skewed right (γ1 = 2.774)

Quantile Statistics

Minimum 0
5-th Percentile 31388
Q1 2.6961e+06
Median 9.0449e+06
Q3 3.8034e+07
95-th Percentile 2.3435e+08
Maximum 3.7712e+08
Range 3.7712e+08
IQR 3.5338e+07

Descriptive Statistics

Mean 3.7164e+07
Standard Deviation 6.2475e+07
Variance 3.9031e+15
Sum 5.203e+09
Skewness 2.774
Kurtosis 8.5466
Coefficient of Variation 1.681
  • Turnover_2012 is not normally distributed (p-value 4.617163228629315e-20)
  • Turnover_2012 has 20 outliers

Total_assets_2012

numerical

Approximate Distinct Count 102
Approximate Unique (%) 72.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2.2 KB
Mean 5.017e+07
Minimum 2176
Maximum 3.7688e+08
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Total_assets_2012 is skewed right (γ1 = 2.6742)

Quantile Statistics

Minimum 2176
5-th Percentile 104260
Q1 2.4695e+06
Median 1.1567e+07
Q3 4.6741e+07
95-th Percentile 3.0429e+08
Maximum 3.7688e+08
Range 3.7688e+08
IQR 4.4272e+07

Descriptive Statistics

Mean 5.017e+07
Standard Deviation 8.914e+07
Variance 7.9459e+15
Sum 7.0238e+09
Skewness 2.6742
Kurtosis 6.7524
Coefficient of Variation 1.7768
  • Total_assets_2012 is not normally distributed (p-value 1.1021865022157869e-19)
  • Total_assets_2012 has 20 outliers

Employees_2012

numerical

Approximate Distinct Count 102
Approximate Unique (%) 72.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2.2 KB
Mean 85729.1714
Minimum 12
Maximum 434246
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Employees_2012 is skewed right (γ1 = 1.4884)

Quantile Statistics

Minimum 12
5-th Percentile 115.55
Q1 8621
Median 30697
Q3 132276
95-th Percentile 333498
Maximum 434246
Range 434234
IQR 123655

Descriptive Statistics

Mean 85729.1714
Standard Deviation 111190.1856
Variance 1.2363e+10
Sum 1.2002e+07
Skewness 1.4884
Kurtosis 1.065
Coefficient of Variation 1.297
  • Employees_2012 is not normally distributed (p-value 4.0316407663094057e-16)
  • Employees_2012 has 15 outliers

R&D_2012

numerical

Approximate Distinct Count 102
Approximate Unique (%) 72.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 2.2 KB
Mean 1.8136e+06
Minimum 1211
Maximum 1.0148e+07
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • R&D_2012 is skewed right (γ1 = 1.6058)

Quantile Statistics

Minimum 1211
5-th Percentile 10680.65
Q1 110064
Median 555631
Q3 3.4056e+06
95-th Percentile 8.5763e+06
Maximum 1.0148e+07
Range 1.0147e+07
IQR 3.2955e+06

Descriptive Statistics

Mean 1.8136e+06
Standard Deviation 2.5438e+06
Variance 6.471e+12
Sum 2.539e+08
Skewness 1.6058
Kurtosis 1.6465
Coefficient of Variation 1.4027
  • R&D_2012 is not normally distributed (p-value 7.909837105990644e-18)
  • R&D_2012 has 9 outliers

Country_code

categorical

Approximate Distinct Count 6
Approximate Unique (%) 4.3%
Missing 0
Missing (%) 0.0%
Memory Size 9.3 KB
  • The largest value (4.0) is over 2.97 times larger than the second largest value (1.0)

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row 4.0
2nd row 4.0
3rd row 2.0
4th row 4.0
5th row 4.0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 280
  • The top 2 categories (4.0, 1.0) take over 50.0%
  • The largest value (40) is over 2.97 times larger than the second largest value (10)
  • Country_code has words of constant length

Interactions

Correlations

Missing Values